Improved Transcription of Czech Parliament Speeches by Acoustic and Language Model Adaptation

نویسندگان

Petr Cerva

Jan Nouza

Jan Kolorenc

Petr David

چکیده

The aim of this work is to improve the accuracy of our spoken broadcast transcription system in the task of Czech parliament speeches recognition. To achieve this goal, we propose several approaches for adaptation of both acoustic and language models of our system: a new two step unsupervised speaker adaptation strategy is presented to improve the former model while the latter one is created from a text corpus mixed properly from both general (2.6 GB of Czech newspaper texts) and domain specific data (181 MB of parliament speeches). Our experimental results show that the combination of both adaptation approaches leads to near 30% relative reduction of WER in comparison with the baseline speaker independent (SI) system operating with a general language model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech-to-text technology to transcribe and disclose 100, 000+ hours of bilingual documents from historical Czech and Czechoslovak radio archive

In this paper, we present the outcome of a 4-year project whose ultimate goal is to develop a complex platform that can transcribe, index and make searchable the historical archive of Czech and Czechoslovak Radio. The archive covers 90 years of public broadcasting and contains hundreds of thousands audio documents. The developed modular platform employs our LVCSR system that has to cope with 2 ...

متن کامل

Fully automated system for Czech spoken broadcast transcription with very large (300k+) lexicon

We present a system developed for fully automated processing of Czech spoken broadcast programs. It includes modules for unsupervised segmentation of audio stream, speaker and gender recognition followed by speaker adaptation, and own speech decoder designed for extremely large vocabularies. Compared to our previous results reported in 2004, the new system reduced the WER (evaluated on the Czec...

متن کامل

Using Unsupervised Feature-Based Speaker Adaptation for Improved Transcription of Spoken Archives

This paper deals with unsupervised feature-based speaker adaptation techniques. The goal is to design an optimal adaptation approach for improving the recognition accuracy of a LVCSR system developed for automatic transcription of large archives of spoken Czech (e.g. the archive of the parliament talks, historical archives of Czech broadcast stations, etc.) For this purpose, several modificatio...

متن کامل

Domain Adaptation of a Broadcast News Transcription System for the Portuguese Parliament

The main goal of this work is the adaptation of a broadcast news transcription system to a new domain, namely, the Portuguese Parliament plenary meetings. This paper describes the different domain adaptation steps that lowered our baseline absolute word error rate from 20.1% to 16.1%. These steps include the vocabulary selection, in order to include specific domain terms, language model adaptat...

متن کامل

The ISL 2007 English speech transcription system for european parliament speeches

The project Technology and Corpora for Speech to Speech Translation (TC-STAR) aims at making a break-through in speech-to-speech translation research, significantly reducing the gap between the performance of machines and humans at this task. Technological and scientific progress is driven by periodic, competitive evaluations within the project. In this paper we describe the ISL speech transcri...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

Improved Transcription of Czech Parliament Speeches by Acoustic and Language Model Adaptation

نویسندگان

چکیده

منابع مشابه

Speech-to-text technology to transcribe and disclose 100, 000+ hours of bilingual documents from historical Czech and Czechoslovak radio archive

Fully automated system for Czech spoken broadcast transcription with very large (300k+) lexicon

Using Unsupervised Feature-Based Speaker Adaptation for Improved Transcription of Spoken Archives

Domain Adaptation of a Broadcast News Transcription System for the Portuguese Parliament

The ISL 2007 English speech transcription system for european parliament speeches

عنوان ژورنال:

اشتراک گذاری